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3 
5 
7 
9 
10 
12 
13 
15 
16 
18 
19 
21 
22 
24 
26 
28 
29 
30 
31 
33 
34 
35 
37 
38 
39 
40 
42 
43 
44 
46 



ENTERED 



<110> APPLICANT: Betenbaugh et al . 
<120> TITLE OF INVENTION: Engineering Intracellular Sialylation Pathways 
FILE REFERENCE: PF509P2 

CURRENT APPLICATION NUMBER: 09/930, 440B 
CURRENT FILING DATE: 2001-08-16 
PRIOR APPLICATION NUMBER: 60/227,579 
PRIOR FILING DATE: 2000-08-25 
PRIOR APPLICATION NUMBER: 09/516,793 
PRIOR FILING DATE: 2000-03-01 
PRIOR APPLICATION NUMBER: 60/169,624 
PRIOR FILING DATE: 1999-12-08 
PRIOR APPLICATION NUMBER: 60/122,582 
PRIOR FILING DATE: 19,9(7^)3-02 
NUMBER OF SEQ ID NOS : 
SOFTWARE: Patentln Vei 
SEQ ID NO: 1 



<130> 
<140> 
<141> 
<150> 
<151> 
<150> 
<151> 
<150> 
<151> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<400> 
atg gcc 
Met Ala 
1 

acg cca 



r er-T 2 . 1 



LENGTH: 1429 
TYPE: DNA 
ORGANISM: 
FEATURE : 
NAME/KEY 
LOCATION 
SEQUENCE 
ttc cca 
Phe Pro 



Homo sapiens 



CDS 

(1). 
1 

aag 
Lys 
5 

gag 



(693) 



aag aaa ctt cag ggt ctt gtg get 
Lys Lys Leu Gin Gly Leu Val Ala 
10 

atg act gag aat gga gaa ate aac ttt tea gta 
Thr Pro Met Thr Glu Asn Gly Glu lie Asn Phe Ser Val 

20 25 
tat gtg gat tat ctt gtg aaa gaa cag gga gtg aag aac 

47 Tyr Val Asp Tyr Leu Val Lys Glu Gin Gly Val Lys Asn 

48 35 40 45 

50 aat ggc aca aca gga gaa ggc ctg tec ctg age gtc tea 

51 Asn Gly Thr Thr Gly Glu Gly Leu Ser Leu Ser Val Ser 

52 50 55 60 

54 cag gtt gca gag gag tgg gtg aca aaa ggg aag gac aag 

55 Gin Val Ala Glu Glu Trp Val Thr Lys Gly Lys Asp Lys 

56 65 70 75 
gtg ata att cac gta gga gca ctg age ttg aag gag tea 
Val lie lie His Val Gly Ala Leu Ser Leu Lys Glu Ser 

85 90 
gcc caa cat gca gca gaa ata gga get gat ggc ate get 



58 
59 
60 
62 



63 Ala Gin His Ala Ala Glu lie Gly Ala Asp Gly lie Ala 



gca acc ate 
Ala Thr lie 
15 

att ggt cag 
He Gly Gin 
30 

att ttt gtg 
He Phe Val 

gag cgt cgc 
Glu Arg Arg 

ctg gat cag 
Leu Asp Gin 
80 

cag gaa ctg 
Gin Glu Leu 
95 

gtc att gca 
Val He Ala 



48 



96 



144 



192 



240 



288 



336 
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64 








100 










ins 










110 






384 


66 


ccg 


ttc 


ttc 


etc 


aag 


cca 


tgg 


ace 


aaa 


y a l 




ctg 


att 


aat 


ttc 


eta 


67 


Pro 


Phe 


Phe 


Leu 


Lys 


Pro 


Trp 


Thr 


Lys 


Asp 


Tip 

J. -L C 


Leu 


lie 


Asn 


Phe 


Leu 




68 






115 










IzU 










125 








432 


70 


aag 


gaa 


gtg 


get 


get 


gcc 


gcc 


cct 


gcc 


/-i 4- rr 

L» uy 




ttt 

L. L. L- 


tat 


tac 


tat 


cac 


71 


Lys 


Glu 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Ala 


Leu 


C i. \J 


rile 




Tvt 


Tvr 
x i x 


His 




72 


130 










135 




















480 


74 


att 


cct 


gcc 


ttg 


aca 


ggg 


gta 


aag 


d L. u 


cgt 




rx p\ rr 

y ay 


era cr 

y ay 


ttg 


tta 


cat 


75 


lie 


Pro 


Ala 


Leu 




Gly Val 


Lys 


He 


Arcr 


Ala 


Glu 


Glu 


Leu 


Leu 


Asp 




76 


145 










150 










155 










160 




78 


ggg 


att 


ctg 


gat 


aag 


ate 


ccc 


acc 


ttc 


caa 


ggg 


ctg 


aaa 


ttc 


agt 


gat 


528 


79 


Gly 


lie 


Leu 


Asp 


Lys 


He 


Pro 


Thr 


Phe 


Gin 


Gly 


Leu 


Lys 


Phe 


Ser 


Asp 




80 








165 










170 










175 




576 


82 


aca 


gat 


etc 


tta 


gac 


ttc 


ggg 


caa 


tgt 


gtt 


gat 


cag 


aat 


cgc 


cag 


caa 


83 


Thr 


Asp 


Leu 


Leu 


Asp 


Phe 


Gly 


Gin 


Cys 


Val 


Asp 


Gin 


Asn 


Arg 


Gin 


Gin 




84 






180 










185 










190 






624 


86 


cag 


ttt 


get 


ttc 


ctt 


ttt 


ggg 


gtg 


gat 


gag 


caa 


ctg 


ttg 


agt. 


get 


ctg 


87 


Gin 


Phe 


Ala 


Phe 


Leu 


Phe 


Gly 


Val 


Asp 


Glu 


Gin 


Leu 


Leu 


Ser 


Ala 


Leu 




88 






195 










200 










205 








672 


90 


gtg 


atg 


gga 


gca 


act 


gga 


gca 


gtg 


ggc 


agt 


ttt 


gta 


tec 


aga 


gat 


tta 


91 


Val 


Met 


Gly Ala 


Thr 


Gly Ala 


Val 


Gly 


Ser 


Phe 


Val 


Ser 


Arg 


Asp 


Leu 





92 210 215 220 

94 tea act ttg ttg tea aac tag gttttggagt gtcacagacc aaagecatea 723 

95 Ser Thr Leu Leu Ser Asn 

96 225 230 

98 tgactctggt ctctgggatt ccaatgggcc caccccggct tccactgcag aaagcctcca 783 
100 gggagtttac tgatagtgct gaagctaaac tgaagagect ggatttcctt tctttcactg 843 
102 atttaaagga tggaaacttg gaagctggta getagtgect ctctatcaaa tcagggtttg 903 
104 caccttgaga cataatctac cttaaatagt gcattttttt ctcagggaat tttagatgaa 963 
106 cttgaataaa ctctcctagc aaatgaaatc tcacaataag cattgaggta ccttttgtga 1023 
108 gecttaaaaa gtcttatttt gtgaaggggc aaaaactcta ggagtcacaa ctctcagtca 1083 
110 ttcatttcac agattttttt gtggagaaat ttctgtttat atggatgaaa tggaatcaag 1143 
112 aggaaaattg taattgatta attccatctg tctttaggag ctctcattat ctcggtctct 1203 
114 ggttcctaat cctattttaa agttgtctaa ttttaaacca ctataatatg tcttcatttt 1263 
116 aataaatatt catttggaat ctaggaaaac tctgagctac tgcatttagg caggcacttt 1323, 
118 aataccaaac tgtaacatgt ctcaactgta tacaactcaa aatacaccag ctcatttggc 1383 
120 tgetcagtet aactctagaa tggatgcttt tgaattcatt tcgatg 1429 

123 <210> SEQ ID NO: 2 

124 <211> LENGTH: 230 

125 <212> TYPE: PRT 

126 <213> ORGANISM: Homo sapiens 

128 <400> SEQUENCE: 2 

129 Met Ala Phe Pro Lys Lys Lys Leu Gin Gly Leu Val Ala Ala Thr He 

130 1 5 10 15 

131 Thr Pro Met Thr Glu Asn Gly Glu He Asn Phe Ser Val He Gly Gin 

132 20 25 30 

133 Tyr Val Asp Tyr Leu Val Lys Glu Gin Gly Val Lys Asn He Phe Val 

134 35 40 45 

135 Asn Gly Thr Thr Gly Glu Gly Leu Ser Leu Ser Val Ser Glu Arg Arg 
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Tyr . 



Val. 



Ser . 



Val. 



1 

1 Jo 




50 






D D 










fin 










1J / 


L»J_n 


Val Ala Glu 


Pin 
V3-LU 


irp 


Va 1 
v a x 


Thr 


Lys 


Gly Lys 


Asp 


Lys 


Leu 


Asp 


Gin 


too 

1 JO 


O 3 








7 M 

/ u 










75 












TOO 

ijy 


Val 


He He His 


Val 


biy 




Leu 


Ser 


Leu 


Lys 


VJ X XX 


Ser 


Gin 


Glu 


Leu 


1 A (\ 

141) 








O D 










90 














1 A 1 

141 


Ala 


Gin His Ala 


A 1 =a 

Aia 


LilU 


ne 


Gly Ala 


Asp 


Gly 


Tip 

X XtJ 


nlct 


Va 1 
v d x 


lie 


Ala 


1 IT 

14 z 






100 










X U J 










1 1 0 






14 J 


Pro 


Phe Phe Leu 


Lys 


Pro 


irp 


1 11X 


Lys 


Asp 


lie 


T.tin 

XlGLL 


Tip 

_L X C 


Asn 


Phe 


Leu 


1 /I A 
14 4 






115 








120 










X £i ~J 








1 A K 


Lys 


Glu Val Ala 


a 1 a 
Ala 


a 1 a 
M.X d. 


a 1 pt 


Pro 


Ala 
r\ XCl 


Leu 


Pro 


r lie 


lyx 


Tvr 


x y x 


His 


i a a 
14 D 




130 
















1 AO 










14 / 


He 


Pro Ala Leu 


Tnr 


Vaiy 


Vdl 


Lys 


Tip 
X XG 


Arg 


Ala 


(1 1 11 




T.^ii 

XlCLl 


T.pn 


Asp 


14 o 


145 








IjU 










155 










160 


i < ft 

14 y 


Gly 


He Leu Asp 


Lys 


lie 


Pro 


Thr 


IT lit: 


Gin Gly 


T o 1 1 


xi_y o 


Jr lit: 


Cpr 
OCX 


nop 


IjU 








165 










170 










175 




iDl 


Thr 


Asp Leu Leu 


Asp 


Phe 


Gly 


Gin 


Cys 


Val 


Asp 


n n 

bill 


Asn 


r\x y 


m n 

U 111 


m n 

VJX11 


1 RO 






180 










185 










190 






1^1 
Ijj 


Gin 


Phe Ala Phe 


Leu 


Phe 


Gly 


Val 


Asp 


Glu 


Gin 


LCU 


Lou 


Ser 


Ala 


Leu 


1D4 






195 








200 










o n r 

ZU J 










Val 


Met Gly Ala 


Thr 


Gly 


Ala 


Val 


Gly 


Ser 


Phe 


Val 


Ser 


Arg 


Asp 


Leu 


lDO 




210 






215 










Z Z\J 










ID / 


Ser 


Thr Leu Leu 


Ser 


Asn 






















158 


225 








230 






















162 


<210> 


SEQ ID NO 


3 
























163 


<211> 


LENGTH: 1305 
























164 


<212>. 


TYPE: DNA 


























165 


<213> 


ORGANISM: 


Homo sapiens 


















167 


<220> 


FEATURE : 


























168 


<221> 


NAME/KEY : 


CDS 
























169 


<222> 


LOCATION- 


(1). 


. (130.5) 




















171 


<220> 


FEATURE : 


























172 


<221> 


NAME/KEY : 


misc_f eature 


















173 


<222> 


LOCATION: 


(397) . . 


(399) 




















174 


<223> 


OTHER INFORMATION 


: The ' yat' at location 397. 


399 


encodes 



176 
177 
178 
179 

181 
182 
183 
184 

186 
187 
188 
189 

191 
192 
193 



<220> FEATURE: 

<221> NAME/KEY: misc_f eature 
<222> LOCATION: (406).. (408) 
<223> OTHER INFORMATION: The 

<220> FEATURE: 

<221> NAME/KEY: misc_f eature 
<222> LOCATION: (439).. (441) 
<223> OTHER INFORMATION: The 



'gkt' at location 406.. 408 encodes amino acid Gly, or 



'yea 1 at location 439.. 441 encodes amino acid Pro, or 



<220> FEATURE: 

<2 21> NAME/KEY: misc_f eature 
<222> LOCATION: (505).. (507) 

<223> OTHER INFORMATION: The 'gkt 1 at location 505. 
<4 00> SEQUENCE: 3 

atg gac teg gtg gag aag ggg gec gec acc tec gtc tec 
Met Asp Ser Val Glu Lys Gly Ala Ala Thr Ser Val Ser 



.507 encodes amino acid Gly, or 



aac ccg egg 48 
Asn Pro Arg 
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194 15 10 15 

196 ggg cga ccg tec egg ggc egg ccg ccg aag ctg cag cgc aac tct cgc 96 

197 Gly Arg Pro Ser Arg Gly Arg Pro Pro Lys Leu Gin Arg Asn Ser Arg 

198 20 25 30 

200 ggc ggc cag ggc cga ggt gtg gag aag ccc ccg cac ctg gca gee eta 144 

201 Gly Gly Gin Gly Arg Gly Val Glu Lys Pro Pro His Leu Ala Ala Leu 

202 35 40 45 

204 att ctg gee egg gga ggc age aaa ggc ate ccc ctg aag aac att aag 192 

205 lie Leu Ala Arg Gly Gly Ser Lys Gly lie Pro Leu Lys Asn lie Lys 

206 50 55 60 

208 cac ctg gcg ggg gtc ccg etc att ggc tgg gtc ctg cgt gcg gee ctg 240 

209 His Leu Ala Gly Val Pro Leu lie Gly Trp Val Leu Arg Ala Ala Leu 

210 65 70 75 80 

212 gat tea ggg gee ttc cag agt gta tgg gtt teg aca gac cat gat gaa 288 

213 Asp Ser Gly Ala Phe Gin Ser Val Trp Val Ser Thr Asp His Asp Glu 

214 85 90 95 

216 att gag aat gtg gee aaa caa ttt ggt gca caa gtt cat cga aga agt 336 

217 lie Glu Asn Val Ala Lys Gin Phe Gly Ala Gin Val His Arg Arg Ser 

218 100 105 110 

220 tct gaa gtt tea aaa gac age tct ace tea eta gat gee ate ata gaa 384 

221 Ser Glu Val Ser Lys Asp Ser Ser Thr Ser Leu Asp Ala lie lie Glu 

222 115 . 120 125 

224 ttt ctt aat tat yat aat gag gkt gac att gta gga aat att caa get 432 

225 Phe Leu Asn Tyr Xaa Asn Glu Xaa Asp lie Val Gly Asn lie Gin Ala 

,226 130 y 135 140 

228 act tct yea tgt tta cat cct act gat ctt caa aaa gtt gca gaa atg 480 

229 Thr Ser Xaa Cys Leu His Pro Thr Asp Leu Gin Lys Val Ala Glu Met 

230 145 150 155 160 
232 att cga gaa gaa gga tat gat tct gkt ttc tct gtt gtg aga cgc cat 528 

-W--> 233 lie Arg Glu Glu Gly Tyr Asp SerXaa Phe Ser Val Val Arg Arg His 
234 165 170 175 

236 cag ttt cga tgg agt gaa att cag aaa gga gtt cgt gaa gtg ace gaa 576 

237 Gin Phe Arg Trp Ser Glu lie Gin Lys Gly Val Arg Glu Val Thr Glu 

238 180 185 190 

24 0 cct ctg aat tta aat cca get aaa egg cct cgt cga caa gac tgg gat 624 

241 Pro Leu Asn Leu Asn Pro Ala Lys Arg Pro Arg Arg Gin Asp Trp Asp 

242 195 200 205 

244 gga gaa tta tat gaa aat ggc tea ttt tat ttt get aaa aga cat ttg 672 
24 5 Gly Glu Leu Tyr Glu Asn Gly Ser Phe Tyr Phe Ala Lys. Arg His Leu 
246 210 215 220 

248 ata gag atg ggt tac ttg cag ggt gga aaa tgg cat act acg aaa tgc 720 
24 9 lie Glu Met Gly Tyr Leu Gin Gly Gly Lys Trp His Thr Thr Lys Cys 
250 225 230 235 240 

252 gag ctg gaa cat agt gtg gat ata gat gtg gat att gat tgg cct att 768 

253 Glu Leu Glu His Ser Val Asp lie Asp Val Asp lie Asp Trp Pro lie 

254 245 250 255 

256 gca gag caa aga gta tta aga tat ggc tat ttt ggc aaa gag aag ctt 816 

257 Ala Glu Gin Arg Val Leu Arg Tyr Gly Tyr Phe Gly Lys Glu Lys Leu 

258 260 265 270 
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260 


aag 


gaa 


ata 


aaa 


ctt 


ttg 


gtt 


tgc 


aat 


att 


crat 


aaa 


tat 


etc 


acc 


aat 


864 


261 


Lys 


Glu 


He 


Lys 


Leu 


Leu 


Val 


Cys 


Asn 


He 


Asp 


Glv 


Cys 


Leu 


Thr 


Asn 




262 






275 










280 










285 










264 


ggc 


cac 


att 


tat 


gta 


tea 


gga 


gac 


caa 


aaa 


aaa 


ata 


ata 


tct 


tat 


gat 


912 


265 


Gly 


His 


He 


Tyr 


Val 


Ser 


Gly Asp 


Gin 


Lys 


Glu 


He 


He 


Ser 


Tyr Asp 




266 




290 










295 










300 












268 


gta 


aaa 


gat 


get 


att 


ggg 


ata 


agt 


tta 


tta 


aag 


aaa 


aat 


aat 


QLL 


gag 


960 


269 


Val 


Lys 


Asp 


Ala 


He Gly 


He 


Ser 


Leu 


Leu 


Lys 


Lys 


Ser 


Gly 


He 


Glu 




270 


305 










310 










315 














272 


gtg 


agg 


eta 


ate 


tea 


gaa 




gee 


tat 


tea 


aag 


cag 


acg 


eta 






1008 


273 


Val 


Arcr 


Leu 


He 


Ser 


Glu 


Arg 


Ala 


Cys 


Ser 


Lys 


Gin 


Thr 


ucu 


Cor 
Del 


OfcJX 




274 










325 










330 










J j J 






276 


tta 


aaa 


ctg 


gat 


tgc 


aaa 


atg 


gaa 


gtc 


acrt 


ata 


tea 


gac 


aag 


L, La 


gca 


1056 


277 


Leu 


Lys 


Leu 


Asp 


Cys 


Lys 




uxu 


Val 


Ser 


Val 


Ser 


Asp 




Leu 


Ala 

/\±a 




278 








340 










345 










350 








280 


gtt 


gta 


crat 
<j«. i_ 


gaa 


tgg 


aga 


aaa 


gaa 


atg 






tgc 


tgg 


aaa 


gaa 


gtg 


1 1 OA 

-L X \J H 


281 


Val 


Val 


Asp 


Glu 


Trp 


Arg 


Lys 


Glu 


Met 


Gly 




Cys 


Trp 


T.trc 


Glu 


Val 




282 






355 










360 










365 










284 


gca 


tat 


ctt 




aat 


gaa 


gtg 


tct 


gat 


gaa 




tgc 


ttg 


ctdy 


aga 


gtg 


i i t;o 


285 


Ala 


Tyr 


Leu 


Gly 


Asn 


Glu 


Val 


Ser 


Asp 


Glu 


Glu 


Cys 


Leu 


Lys 


Arg 


Val 




286 




370 










375 










380 












288 


ggc 


eta 


agt 


ggc 


get 


cct 


get 


gat 


gee 


tgt 


tec 


tac 


gee 


cag 


aag 


get 


1200 


289 


Gly 


Leu 


Ser 


Gly 


Ala 


Pro 


Ala 


Asp 


Ala 


Cys 


Ser 


Tyr 


Ala 


Gin 


Lys 


Ala 




290 


385 










390 










395 










400 




292 


gtt 


gga 


tac 


att 


tgc 


aaa 


tgt 


aat 


ggt 


ggc 


cgt 


ggt 


gec 


ate 


cga 


gaa 


1248 


293 


Val 


Gly 


Tyr 


He 


Cys 


Lys 


Cys 


Asn 


Gly 


Gly 


Arg 


Gly Ala 


He 


Arg 


Glu 




294 










405 










410 










415 






296 


ttt 


gca 


gag 


cac 


att 


tgc 


eta 


eta 


atg 


gaa 


aaa 


gtt 


aat 


aat 


tea 


tgc 


1296 


297 


Phe 


Ala 


Glu 


His 


He 


Cys 


Leu 


Leu 


Met 


Glu 


Lys 


Val 


Asn 


Asn 


Ser 


Cys 




298 








420 










425 










430 








300 


caa 


aaa 


tag 




























1305 



301 Gin Lys 

304 <210> SEQ ID NO: 4 

305 <211> LENGTH: 434 

306 <212> TYPE: PRT 

307 <213> ORGANISM: Homo sapiens 

309 <22 0> FEATURE: 

310 <221> NAME/KEY: misc_f eature 

311 <222> LOCATION: (133).. (133) 

312 <223> OTHER INFORMATION: The 'Xaa' at location 133 stands for His, or Tyr. 

314 <220> FEATURE: 

315 <221> NAME/KEY: misc_f eature 

316 <222> LOCATION: (136).. (136) 

317 <223> OTHER INFORMATION: The 'Xaa' at location 136 stands for Gly, or Val. 

319 <220> FEATURE: 

320 <221> NAME/KEY: misc_f eature 

321 <222> LOCATION: (147).. (147) 

322 <223> OTHER INFORMATION: The 'Xaa • at location 147 stands for Pro, or Ser. 
324 <220> FEATURE : 

Use of n and/or Xaa has been detected in the Sequence listing, 
Review the Sequence Listing to mr.uw u tmn ^i^wllng 
, j explanation is presented in the <Mtt> to OS 
' each sequence using n or Xa&> . 
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L:225 M;341 W 
L:229 M:341 W 
L:233 M:341 W 
L:346 M:341 W 
L:348 M:341 W 
L: 350 M: 341 W 



(46) 
(46) 
(46) 
(46) 
(46) 
(46) 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/930, 440B 



DATE: 02/13/2002 
TIME: 15:24:23 



Input Set : A:\PF509p2SecondSubSeqList.txt 

Output Set: N:\CRF3\02132002\I930440B.raw 

or "Xaa" used, for SEQ ID#:3 

or "Xaa" used, for SEQ ID#:3 

or "Xaa" used, for SEQ ID# : 3 

or "Xaa" used, for SEQ ID#:4 

n n" or "Xaa" used, for SEQ ID# : 4 

"n" or "Xaa" used, for SEQ ID#:4 



"n r 
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